Building a Discourse-Tagged Corpus in the Framework of Rhetorical Structure Theory

نویسندگان

  • Lynn Carlson
  • Daniel Marcu
  • Mary Ellen Okurovsky
چکیده

We describe our experience in developing a discourse-annotated corpus for community-wide use. Working in the framework of Rhetorical Structure Theory, we were able to create a large annotated resource with very high consistency, using a well-defined methodology and protocol. This resource is made publicly available through the Linguistic Data Consortium to enable researchers to develop empirically grounded, discourse-specific applications.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Building Chinese Discourse Corpus with Connective-driven Dependency Tree Structure

In this paper, we propose a Connectivedriven Dependency Tree (CDT) scheme to represent the discourse rhetorical structure in Chinese language, with elementary discourse units as leaf nodes and connectives as non-leaf nodes, largely motivated by the Penn Discourse Treebank and the Rhetorical Structure Theory. In particular, connectives are employed to directly represent the hierarchy of the tree...

متن کامل

The Prosody of Discourse Structure and Content in the Production of Persian EFL Learners

The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...

متن کامل

A Cross-Disciplinary Genre Analysis of Rhetorical Features of Research Article Introductions Written by Iranians

The notion of genre has received a great deal of attention both in discourse analytic studies as well as in the field of ESP/EAP course design. The present paper has attempted to use genre analysis to account for the rhetorical features of research article introductions written by Iranian academics in two disciplinary fields of Education and Economics. The corpus comprised 40 research article i...

متن کامل

A Corpus Study of Referential Choice: The Role of Rhetorical Structure

This study shares the view that reference in discourse is influenced by the distance to prior mentions of the referent in the discourse. Kibrik (1996, 1999) suggested a measurement of rhetorical distance to assess this factor. In this paper we address three complications created by that methodology when applied to a large corpus of written newspaper texts. These problems include: difference bet...

متن کامل

A Corpus-based Study of Lexical Bundles in Discussion Section of Medical Research Articles

There has been increasing interest in utilizing corpora in linguistic research and pedagogy in recent years. Rhetorical organization of different sections of research articles may appear similar in various disciplines, but close examination may show subtle differences nonetheless. One of the features that has been at the center of attention especially in recent years is the idiomaticity of a di...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001